Deeper Attention to Abusive User Content Moderation
نویسندگان
چکیده
Experimenting with a new dataset of 1.6M user comments from a news portal and an existing dataset of 115K Wikipedia talk page comments, we show that an RNN operating on word embeddings outpeforms the previous state of the art in moderation, which used logistic regression or an MLP classifier with character or word n-grams. We also compare against a CNN operating on word embeddings, and a word-list baseline. A novel, deep, classificationspecific attention mechanism improves the performance of the RNN further, and can also highlight suspicious words for free, without including highlighted words in the training data. We consider both fully automatic and semi-automatic moderation.
منابع مشابه
Graph-Based Features for Automatic Online Abuse Detection
While online communities have become increasingly important over the years, the moderation of user-generated content is still performed mostly manually. Automating this task is an important step in reducing the financial cost associated with moderation, but the majority of automated approaches strictly based on message content are highly vulnerable to intentional obfuscation. In this paper, we ...
متن کاملImpact Of Content Features For Automatic Online Abuse Detection
Online communities have gained considerable importance in recent years due to the increasing number of people connected to the Internet. Moderating user content in online communities is mainly performed manually, and reducing the workload through automatic methods is of great financial interest for community maintainers. Often, the industry uses basic approaches such as bad words filtering and ...
متن کاملImproved Abusive Comment Moderation with User Embeddings
Experimenting with a dataset of approximately 1.6M user comments from a Greek news sports portal, we explore how a state of the art RNN-based moderation method can be improved by adding user embeddings, user type embeddings, user biases, or user type biases. We observe improvements in all cases, with user embeddings leading to the biggest performance gains.
متن کاملEverything in Moderation 1 Everything in Moderation: A case for the balanced moderation of user-generated content on news sites
Moderation of user-generated content on news Web sites is an increasingly relevant and pertinent topic for online news entities. The quality and quantity of user-generated content can either help or hinder the number of audience members a news outlet receives. Considerations such as the amount of resources that can be given to moderation, the types of moderation, the types of usergenerated cont...
متن کاملDetection of abusive messages in an on-line community
Moderating user content in online communities is mainly performed manually, and reducing the workload through automatic methods is of great interest. The industry mainly uses basic approaches such as bad words filtering. In this article, we consider the task of automatically determining whether a message is abusive or not. This task is complex, because messages are written in a non-standardized...
متن کامل